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Herbrand Consistency of Some Arithmetical Theories 



Abstract 

Godel's second incompleteness theorem is proved for Herbrand consistency of some arithmetical 
theories with bounded induction, by using a technique of logarithmic shrinking the witnesses of 
bounded formulas, due to Z. Adamowicz [Herbrand consistency and bounded arithmetic, Fundamenta 
Mathematicae 171 (2002) 279-292]. In that paper, it was shown that one cannot always shrink the 
witness of a bounded formula logarithmically, but in the presence of Herbrand consistency, for theories 
IAq + Om with 771 ^ 2, any witness for any bounded formula can be shortened logarithmically. This 
immediately implies the unprovability of Herbrand consistency of a theory T 5 IAq + ^^2 in T itself. 

In this paper, the above results are generalized for IAq + ^i- Also after tailoring the definition 
of Herbrand consistency for IAq we prove the corresponding theorems for IAq. Thus the Herbrand 
version of Godel's second incompleteness theorem follows for the theories IAq + ili and IAq. 
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1 Introduction 



By Godel's first incompleteness theorem Truth is not the same as Provability in sufficiently strong theories. 
In other words, Provable is a proper subset of True, and thus True is not conservative over Provable. It is not 
even Hi— conservative; i.e., there exists a Hi— formula, in theories which can interpret enough arithmetic, 
which is true but unprovable in those theories. Thus one way of comparing the strength of a theory T 
over one of its sub-theories S is considering the IIi— conservativeness of T over 5*. And Godel's second 
incompleteness theorem provides such a Hi— candidate: Con(S'), the statement of the consistency of S. 
By that theorem 5* 1/ Con(S'), but if T h Con(5') then T is not IIi— conservative over S. 

Examples abound in mathematics and logic: Zermelo-Frankel Set Theory ZFC is not Hi— conservative 
over Peano's Arithmetic PA, because ZFC h Con(PA) but PA 1/ Con(PA). Inside PA the S„— hierarchy 
is not a Hi— conservative hierarchy, since ISn+i l~ Con(ISn) though ISn 1/ Con(lSn); see e.g. [7]. Then 
below the theory lEi things get more complicated: for IIi— separating IAq + Exp over IAq the candidate 
Con(IAo) does not work, because IAq + Exp 1/ Con(lAo). For this Hi— separation, Paris and Wilkie [10] 
suggested the notion of cut-free consistency instead of usual - Hilbert style - consistency predicate. Here 
one can show that IAq + Exp h CFCon(IAo), and then it was presumed that IAq 1/ CFCon(IAo), where 
CFCon stands for cut-free consistency. But this presumption took a rather long time to be established. 
Meanwhile, Pudlak in [11] established the Hi— separation of IAq + Exp over IAq by other methods, and 
mentioned the unprovability of CFCon(IAo) in IAq as an open problem. This problem is interesting in its 
own right. Indeed Godel's second incompleteness theorem has been generalized to all consistent theories 
containing Robinson's Arithmetic Q, in the case of Hilbert consistency; see [7]. But for cut-free consistency 
it is still an open problem whether the theorem holds for Q, and its not too strong extensions. This is a 
double strengthening of Godel's second incompleteness theorem: weakening the theory and weakening the 
consistency predicate. Let us note that since cut-free provability is stronger than usual Hilbert provability 
(with a super-exponential cost), then cut free consistency is a weaker notion of consistency. Indeed, proving 
Godel's second incompleteness theorem for weak notions of consistencies in weak arithmetics turns out to 
be a difficult problem. We do not intend here to give a thorough history of this ongoing research area, let 
us just mention a few results: 

• Z. Adamowicz was the first one to demonstrate the unprovability of cut free consistency in bounded 
arithmetics, by proving in an unpublished manuscript in 1999 (later appeared as a technical report [1]) 
that the tableau consistency of IAq + fii is not provable in itself. Later with P. Zbierski (2001) she proved 
Godel's second incompleteness theorem for Herbrand consistency of IAq + ^2 (see [2]), and a bit later she 
gave a model theoretic proof of it in 2002; see [5]. 

• D. E. Willard introduced an IAq— provable Hi— formula V and showed that any theory whose axioms 
contains Q + V cannot prove its own tableaux consistency. He also showed that tableaux consistency of 
IAq is not provable in itself, see [T^flS] ; this proved the conjecture of Paris and Wilkie mentioned above. 

• S. Salehi (see [13] Chapter 3 and also [12]) showed the unprovability of Herbrand consistency of a 
re-axiomatization of IAq in itself, the proof of which was heavily based on [2] . The re-axiomatization used 
PA~, the theory of the positive fragment of a discretely ordered ring, as the base theory, instead of Q, and 
assumed two IAq— derivable sentences as axioms. Also the model-theoretic proof of Z. Adamowicz in [3] 
was generalized to the IAq + fli case in Chapter 5 of JI^. A polished and updated proof of it appears in 
the present paper. 

• L. A. Kolodziejczyk showed in [8J that the notion of Herbrand consistency cannot Hi— separate the 
hierarchy of bounded arithmetics (this Hi— separation is still an open problem). Main results are the 
existence of an n for any given m ^ 3 such that Sm 1/ HCon(S'^), and the existence of a natrual n such 
that [J^Sm l/HCon(S'3), where HCon stands for Herbrand consistency. 
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• Z. Adamowicz and K. Zdanowski have obtained some results on the unprovability of the relativized notion 
of Herbrand consistency in theories containing IAq + fii; see [Ij. Their paper contains some insightful ideas 
about the notion of Herbrand consistency. 

For lAo + fii the arguments are rather smoother, in comparison to the case of IAq. Our proof for the 
main theorem on IAq + fii borrows many ideas from [3], the major difference being the coding techniques 
and making use of a more liberal definition of Herbrand consistency. The definition of HCon given in |2] 
and [3] depends on a special coding given there. For reading the present paper no familiarity with [2] is 



needed, but a theorem of [3J will be of critical use here (Theorem 22). We will even use a modified version 



of it (Theorem 37). For IAq we will see that our definition of HCon is not best suited for this theory; and 
we will actually tailor it for IAq. A hint for the obstacles in tackling Herbrand consistency in IAq can be 
found in Chapters 3 and 4 of [T3j . 

In Section 2 we introduce the ingredients of Herbrand's theorem from the scratch, and then explain 
how they can be arithmetized by Godel coding. This sets the stage for Section 3 in which we formalize 
the notion of Herbrand model and use it to prove our main theorem for IAq + fii. Finally in Section 4 
we modify our definitions and theorems to fit the IAq case. After pinpointing the places where we have 
made an essential use of ^i, we do some tailoring for IAq, and prove our main result for IAq. We finish 
the paper with some conclusions and some open questions. 

2 Basic Definitions and Arithmetizations 

This section introduces the notions of Herbrand provability and Herbran consistency, and a way of for- 
malizing and arithmetizing these concepts. The first subsection can be read by any logician. The second 
subsection gets more technical with Godel coding, for which some familiarity with [7] is presumed. 

2.1 Herbrand Consistency 

Skolemizing a formula is usually performed on prenex normal forms (see e.g. [6]), and since prenex 
normalizing a formula is not necessarily done in a unique way, then one may get different Skolemized 
forms of a formula. For example, the tautology F = Vx0(x) — )■ Vx0(x) can be prenex normalized into 
either \/x3y{(f>{y) — > 0(x)) or 3yWx{(f){y) — )■ (f>{x)). These two formulas can be Skolemized respectively as 
(j){f{x)) — )■ (f){x) and 0(c) — > 0(x), where f is a new unary function symbol, and c is a new constant symbol. 
Here we briefiy describe a way of Skolemizing a (not-necessarily prenex normal) formula which results in 
a somehow unique (up to a variable renaming) formula. 

A formula is in negation normal form when the implication symbol does not appear in it, and the 
negation symbol appears in front of atomic formulas only. A formula can be (uniquely) negation normalized 
by the following rewriting rules: 

(A-^ B)^^ {^A V B) ^^A ^ A 

-n{Ay B)^^{^AA^B) ^(AAB)^^ (^Ay^B) 

-i^xA{x) \=^ 3x^A{x) -'3xA{x) \=^ \/x^A{x) 

A formula is called rectified if no variables appears both bound and free in it, and different quantifiers 
refer to different variables. A formula is called rectified negation normal if it is both negation normalized 
and rectified. Again, any formula can be rectified. Indeed, any given formula is equivalent to its rectified 
negation normal form (RNNF) which can be obtained from the formula in a unique (up to a variable 
renaming) way (see e.g. |S]). 

^Sa&fe"^'" http://saeedsalehi.ir/pdf/licon.pdf (c) Saeed Salehi 2010 
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Now we introduce Skolem functions for existential formulas: for any (not necessarily RNNF) formula of 
the form 3xA{x), let f3xA{x) be a new tti— ary function symbol where m is the number of the free variables 
of 3xA{x). When m = then f3xA{x) will obviously be a new constant symbol (cf. [6]). 

Definition 1 Let ip be an RNNF formula. Define ip^ by induction: 

• {p^ = ip for atomic or negated- atomic ip; 

• (ip o tp)^ = (^s Q ^s £qj^ q ^ |y\^ Yj gj^^ RNNF formulas ip, ip; 

• (yxip)^ = Vx(y9^; 

• (3xip)^ = 'P^[f3xif{x){y)/x] where y are the free variables of 3x(p{x) and the formula 'P^[f3x<fi{x){,y)/x] 
results from the formula ip^ by replacing all the occurrences of the variable x with the term f3x<fi{x){,y)- 

The Skolemized form of any (not necessarily RNNF) formula ip is obtained in the following way: using 
the above rewriting rules we negation normalize ip and then rename the repetitive variables (if any) to 
get a rectified negation normal form of ip, say ip. Then we get ip^ by the above definition, and remove 
all the (universal) quantifiers in it (together with the variables next to them). We denote thus resulted 
Skolemized form of ip by ip^^. O 

Note that ip^^ can be obtained from tp in a. unique (up to a variable renaming) way, and it is an open 
(quantifier-less) formula. For the above example F, assuming that is atomic, we get 

F^ = (3x^0(x) V Vx0(x))5 = ^0(c) V Vx0(x), 
and thus F^^ = ^0(c) V 0(x) = 0(c) -^ (p{x). 

Definition 2 An Skolem instance of the formula ip is any formula resulted from substituting the free vari- 
ables of ip^^ with some terms. So, iixi, . . . ,Xn are the free variables of ip^^ (thus written as tp^^{xi, . . . , Xn)) 
then an Skolem instance of ip is ip^^[ti/xi, ■ ■ ■ , t„/x„] where ti, . . . ,tn are terms (which could be constructed 
from the Skolem functions symbols). 

Skolemized form of a theory T is by defintion T^^ = {ip^^ \ v^gT}. O 

We are now ready to state an important theorem discovered by Herbrand (probably by also Skolem 
and Godel). This theorem has got some few names, and by now is a classical theorem in Mathematical 
Logic. Here we state a version of the theorem which we will need in the paper. The proof is omitted, 
though it is not too difficult to prove it directly (see e.g. |5j). 

Theorem 3 (Herbrand) Any theory T is equiconsistent with its Skolemized theory T^^. In other words, 
T is consistent if and only if every finite set of Skolem instances of T is (propositionally) satisfiable. O 

We will use the above theorem, which reduces the consistency of a first-order theory to the satisfiability 
of a propositional theory, for the definition of Herbrand Consistency: a theory T is Herbrand consistent 
when every finite set of Skolem instances of T is propositionally satisfiable. One other concept is needed 
for formalizing Herbrand consistency of arithmetical theories: evaluation. 

Convention 4 Throughout the paper we deal with closed (or ground) terms (i.e., terms with no variable) 
and for simplicity we call them "term". For this to make sense, we may assume that the language of the 
theory under consideration has at least one constant symbol. O 

(c) Saeed Salehi 2010 http://saeedsalehi.ir/pdf/licon.pdf ^Ea^efe ■^'' 
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Definition 5 An evaluation is a function whose domain is the set of all atomic formulas constructed from 
a given set of terms A and its range is the set {0,1} such that 

(i) p[t = t] = 1 for all tGA; and for any terms t, sGA, 

(ii) if p [t = s] = 1 then p [^{t)] = p [f{s)] for any atomic formula <f{x). 

The relation ^p on A is defined by t ^p s <^=^ p[t = s] = 1 for t, s G A. O 

Lemma 6 The relation ^p defined above is an equivalence relation. 

Proof. For (p{x) = {s = x) from p[t = s] = 1 one can infer p[s = t] = p [^p{t)] = p [^p{s)] = p[s = s] = 1. 
So, t -^p s implies s ^p t. Also for 0(a;) = {t = x) the condition p [s = r] = 1 implies p\t = s\ = p [4>{s)] = 
p [(f){r)] = p[t = r]. So, ^p is a symmetric and transitive (also, by definition, a refiexive) relation. @ 

Notation 7 The ^p— class of a term t is denoted by t/p; and the set of all such p— classes for each t G A 
is denoted by A/ p. 

For simplicity, we write p \= (f instead of p [y^] = 1; thus p ^ y^ stands for p [ip] = 0. This definition of 
satisfying can be generalized to other open formulas in the usual way: 

• p \= if Alp ii and only ii p \= if and p \= ijj] 

• p \= ip\/ ip ii and only ii p \= ip or p \= ip; 

• p \= -i^p if and only ii p ^ if. O 

Let us note that ^p is a congruence relation as well. That is, for any set of terms ti and Sj {i = 1, . . . , n) 
and function symobl f , ii p \= ti = Si A ■ ■ ■ A tn = Sn then p \= f{ti, . . . ,tn) = /(si, . . . ,Sn)- 

Definition 8 If all terms appearing in an Skolem instance of belong to the set A, that formula is called 
an Skolem instance of available in A. 

An evaluation defined on A is called a 0— evaluation if it satisfies all the Skolem instances of which are 
available in A. 

Similarly, for a theory T, a T— evaluation on A is an evaluation on A which satisfies every Skolem instance 
of every formula of T which is available in A. O 

For illustrating the above concepts we now present an example. 

Example 9 Take the language C = {g, P, R, S} in which g is a. binary function symbol, and P is a binary 
predicate symbol, and R, S are unary predicate symbols. Let the theory T be axiomatized by: 

Ti: \/x3yPix,y); 
^, T2: Wx{R{x)\/S{gx))- 

T3: yx,y{-nP{x,y)y^Six)). 
Let us, for the sake of simplicity, denote f3yp[x,y) by f ; then the Skolemized form of the above theory is: 

Tfk; P(x,fx); Tf'^: P(x) V 5(^x); Tf^ : -^P{x,y) W ^S{x). 

For a constant symbol c let A = {c,gc,fc}. Then P(c, fc) and P(c) V S{gc) are Skolem instances of T (of 
Ti and T2) available in A, but the Skolem instance R{gc) V S{ggc) of T2 is not available in A. Let us note 
also that the Skolem instance -iP{gc, fgc) V -iS{gc) of T3 is not available in A. 

^Sa&fe"^'" http://saeedsalehi.ir/pdf/licon.pdf (c) Saeed Salehi 2010 
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Let q be an evaluation on A whose set of true atomic formulas is {P{c, fc), -R(c)}. Then g is a T— evaluation. 
On the other hand the evaluation r on A whose set of true atomic formulas is {P{c, fc), -R(c), S{c)}, is not 
a T— evaluation, though it satisfies all the Skolem instances of Ti and T2 which are available in A. Note 
that r does not satisfy the Skolem instance ~i-P(c, fc) V ^S{c) of T3. O 

By the above theorem of Herbrand, a theory T is consistent if and only if every finite set of its Skolem 
instances is satisfiable, if and only if for every finite set of terms A there is a T— evaluation on A. And for a 
formula yj, T h yj if and only if there exists a finite set of terms A such that there is no (T + -i(/?)— evaluation 
on A. We call this notion of provability, Herbrand Provability] note that then Herbrand Consistency of a 
theory T means the existence of a T— evaluation on any (finite) set of terms. 

Example 10 In the previous example, let ip = ^xR{x). We show T \- (phy Herbrand provability. Write 
-lip = 3x^R{x), and let c denote the Skolem constant symbol f3x^R(x)', so we have {^(p)^^ = -i-R(c). Put 
A = {c, gc, fgc}, and assume (for the sake of contradiction) that there is a (T+-i(^)— evaluation p on A. Then 
p must satisfy the following Skolem instances of T in A: P{gc, fgc), R{c) V S{gc), and -^P{gc, fgc) V -^S^gc). 
Whence p must also satisfy -^S^gc) and R{c). So p cannot satisfy the Skolem instance ~i-R(c) of -iip in A. 
Thus there cannot be any (T + -lys)— evaluation on A; whence T \- ip. 

Note that finding an appropriate A is as complicated as finding a formal proof. For example we could not 
have taken A as {c, gc, fc}, since the evaluation q in the previous example would be a (T + -k/?)— evaluation 
on that set. O 

The following couple of examples give a thorough illustrations for the above ideas, and they will be 
actually used later in the paper. 

Example 11 Let Q denote Robinson's Arithmetic over the language of arithmetic {0,5, +, ■, ^), where 
is a constant symbol, s is a unary function symbol, +, ■ are binary function symbols, and ^ is a binary 
predicate symbol, whose axioms are: 



Al 


Vx(sx ^ 0) 


^3 


yx{x 7^ -)■ 3y[x 


A, 


Va;(x + = x) 


A7 


Vx(x -0 = 0) 



sy]) 



A2 

A, 

A, 



yx\fy{sx = sy ^ X = y) 
\/x\/y{x ^ y ^ 3z[x + z = y]) 
Wx\/y{x + sy = 5{x + y)) 
WxWy{x ■ sy = X ■ y + x) 



Let i/j = \/x{x ^ — 7- X = 0) and ip = WxWy{x ^ sy ^ x = sy \/ x ^ y). We can show Q\- ip and Q \- (p; 
these will be proved below by Herbrand provability. Suppose Q has been Skolemized as below: 



V. 



A? 



Sk 



AT 
Af 



SXy^O 

X = oy X ■■ 

X + = X 
x-0 = 



spx 



A 
A 



Sk 



Sk 



4Sk 

^6 



A 



Sk 



sx j^ sy y X = y 

[x ^ y y X + [){x , y) = y] A [x + z j^ y y X ^ y] 

X + sy = s{x + y) 

X ■ sy = X ■ y + X 



Here p abbreviates f3y{x=sy) and f) stands for f3z{x+z=y)- 

For a fixed term t, put S^ be the following set of terms: 

Si = {0,t,t + 0,[)(t,0),pf)(t,0),5p[)(t,0),t + 5p[)(t,0),5(t + 5pf)(t,0))}, 
and suppose that p is an Q— evaluation on S^. We show that p|=t^OVt = 0. Note that Skolemizing 
ijj results in ip^^ = (x ^ V x = 0). If p is such an evaluation and if p |= t ^ 0, then by A4 we have 
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p 1= t + f)(t,0) = 0. Now, either p |= f)(t, 0) = or p ^ f)(^,0) = 0. In the former case, we have 
p 1= t + = t which by A^ imphes p |= t = 0. In the latter case, by A^ we get p |= f)(t, 0) = sp[)(t, 0), and 
then p 1= = t + P)(t, 0) = t + spf)(t, 0) = s(t + pf)(t, 0)) by Ae? which is a contradiction with Ai. Thus we 
showed that if p |= t ^ then necessarily p\= t = Q. 

Now, for two fixed terms -u, v define r„^„ as 

^u,v = {0,M,f,St;, \){u,5v),p\){u,5v),5p\){u,5v),U + pP)(-U,Sf), 
U +5pf)(u,Sf),5(M + pf)(u,Sf))}. 

We show that any Q— evaluation on T^^v must satisfy u^sv\/u = sv\/u<v. Note that the Skolemized 
form of ip is ip^^ = {x ^ sy \/ x = 5y \/ x ^ y) . Suppose p is an Q— evaluation on r„^^. Then either 
p 1= i){u,5v) = or p 1= l){u,sv) ^ 0. In the former case, by A4, we have p|=M^SfV-u + = Sf , and then 
by ^5, p 1= M ^ St; V M = 5^;. And in the latter case p |= \)(u,5v) = 5pi){u,5v) by ^3, also by A4 we have 
p \= u ^ sv y u + 5pi){u, sv) = 5v. On the other hand from A^ we get p \= u + spi){u,sv) = 5{u + pl){u,sv)). 
Whence we get p \= u ^ 5v y 5{u + pf)(M,Sf)) = sv, then by A2, p \= u ^ 5v \/ u + pl){u,5v) = v, which by 
A4 implies p\=u^5v\/u^v. Hence, in both cases we showed p\=u^sv\/u = 5v\/u^v. Finally, let 
us note that one could present a Herbrand proof oi Q,\- ip and Q,\- (p very similarly. O 



Example 12 In the language of Example 11 , (0,s, +, ■, ^), let ind^ be the following induction scheme for 
the formula il){x): 

■?/'(0) A \/x[i'{x) -)■ ipisx)) -)■ Wxip{x). 

Assume for the moment that ip is an atomic formula. Then the Skolemization of ind^ results in ind^ : 

-'ipiO) V [il>{c) A -i'?/'(sc)) V ipix), where c is the Skolem constant symbol f / \. Then any 

ind^— evaluation p on the set of terms {0, c,5c, t} must satisfy one of the following: 

either (1) p ^ V^(0) or (2) p |= ^'(c) A -.^(sc) or (3) p \= i){t). 

Now take ip{x) to be the existential formula 3yip{x, y) in which ip is an atomic formula. Then the Skolemized 
form of ind^ will be as 

ind^'^ : ^ip{Q,u) V f(^(c, qc) A -1^9(30, f)j V ip{x,c\{x)), 

where q is the Skolem function symbol for the formula 3y(p{x,y), and c is the Skolem constant symbol for 
the sentence 3x(3wip{x,w) A Vf-i(y9(sx,f)). The variables u, v and x are free. 

We will need the case oi ip{x,y) = (^y^x-xAy = x-x^ in the proof of Theorem 



the Skolemized form of ind^ is 



38 



below. In this case 



(m ^ 0^ V M ^ 0^) Y 

('(qc ^ c^ A qc = c^) A [v ^ [scY V 1; ^ {scf)) V 

(q(x) < x^ A q(x) = x^). 



The notation g'^ is a shorthand for g ■ q. Define the set of terms T by 

T = {0, + 0, 02, c, c2, e + 0,sc, qc, (sc)^, [stf + 0} 
and suppose p is an {Q + ind^)— evaluation on the set of terms T U {t,t^, q(t)}. Then p must satisfy the 
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following Skolem instance (S) of ind^ which is available in the set T U {t,t^, c{{t)}'- 

(0 ^ 0^ V ^ 0^) Y 

((qc ^ c^ A qc = c^) A {{scf ^ (sc)' V {scf ^ (sc)^)) \j 

(q(t)^t2Aq(t)=t2). 

Now since p|=0-0 = + = then, by Q's axioms, p|=O^O^AO = 0^, and so p cannot satisfy the 
first disjunct of (5). Similarly, since p |= (sc)^ + = (sc)^ then p |= (sc)^ ^ (-sc)^, thus p cannot satisfy the 
second disjunct of (5) either, because p |= (sc)^ = (sc)^. Whence, p must satisfy the third disjucnt of (S), 
then necessarily p |= l(^) = ^^ must hold. O 



In Example 11 we used the axioms of Robinson's Arithmetic Q to derive two sentences that will be needed 



later (see Lemma 25). In Example 12 we used an axiom of IAq to derive the existence of an squaring 



Skolem function symbol (see the proof of Theorem 38 ) 



Remark 13 The arguments of the above two examples can be generalized as follows: if T h ^x6{x) where 
9 is an open (quantifier-less) RNNF formula, then {-NxOijic))^^ = ~i0{c) in which c is a sequence of Skolem 
constant symbols. There exists a set of terms F (constructed from the Skolem function and constant 
symbols of T with c) such that there exists no (T + -iVx^(x))— evluation on F. So, for any sequnece of 
terms t, if F(t) is the set of terms which result from the terms of F by substituting c with i, then any 
T— evaluation on F(t) must satisfy the formula 9{t). O 

2.2 Arithmetization 

Fix Ca to be our language of arithmetic; one can set Ca = (0, 1, +, ■, <) as e.g. in [9] or Ca = (0,5, +, ■, ^) 
as e.g. in [7j. Later it will be clear that choosing this fixed language is not of much importance. 



Peano's arithmetic PA is the first-order theory that extends Q (see Example 11) by the following 
induction schema for any arithmetical formula ip{x): f{0) A \/x(^ip{x) -^ ip(x + 1)) — )■ Wxip{x). Fragments 
of PA are extensions of Q with the induction schema restricted to a class of formulas. A formula is called 
bounded if its every quantifier is bounded, i.e., is either of the form Va; ^ t(. . .) or 3x ^ t{. . .) where t is 
a term; they are read as Vx(x ^ t — t- . . .) and 3x(s ^ t A . . .) respectively. It is easy to see that bounded 
formulas are decidable. The theory IAq, also called bounded arithmetic, is axiomatized by Q plus the 
induction schema for bounded formulas. The exponentiation function exp is defined by exp(a;) = 2^; the 
formula Exp expresses its totality: (\/x3y[y = exp{x)]) . The converse of exp is denoted by log which is 
formally defined as logx = min{?/ | x ^ exp{y)}; and the cut iog consists of the logarithms of all elements: 
£og = {x I 3y[exp{x) = y]}- The superscripts above the function symbols indicate the iteration of the 
functions: exp^(a;) = exp(exp(x)), log^x = log log x; similarly the cut iog"^ is {x \ 3|/[exp"(x) = y]}. Let us 
recall that Exp is not provable in IAq; and sub-theories of IAq + Exp are called weak arithmetics. Between 
IAq and IAq + Exp a hierarchy of theories is considered in the literature, which has close connections with 
computational complexity. Define the function cOm to be ujm{x) = exp™ ((log*" x) ■ (log"^ x)). It is customary 
to define this function by induction: uo{x) = x^ and ijJn+i{x) = exp(a;n(logx)). Let Q^ express the totality 
of Wm (i.e., fim = Wx3y[y = Um{x)]). 

By Godel's coding method, we are now rest assured that the concepts introduced in the pervious 
section all can be formalized (and arithmetized) in the language of arithmetic. But we need just a bit 
more; and that is an "effective" coding, suitable for bounded arithmetic. The one we adopt here is taken 
from Chapter V of [7]. For convenience, and shortening the computations, we introduce the V notation. 
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Definition 14 We say x is of V{y), when the code of x is bounded above by a polynomial of y; and we 
write this as '"a;"' ^ Vly), meaning that for some n the inequality '"x"' ^ y" + n holds. O 

Let us note that X ^ VlY) is equivalent to the old (more familiar) O— notation "logX G (9(logy)". 
Here we collect some very basic facts about this fixed efiicient coding that will be needed later. 

Remark 15 Let A be a set or a sequence of terms, and let |y4| denote the cardinality of A, when A is a 
set, and the same |y4| denote the length of A, when A is a sequence. Then 

• ^{ay ^ 9{^a^ + 1)2 (Lemma 3.7.2 page 297 of [7J); 

• ^A^B^ {^A U B^) ^ 64 • (^A^ . r^n) (Proposition 3.29 page 311 of 0); 

• {\A\) ^ (log ^A'^) (Definition 3.27 and Section (e) pages 304-310 of [7]); 

where '"A^i?"' is the concatenation of (the sequences) A and B. O 



If we let C^ to be the closure of Ca under Skolem function and constant symbols, i.e., let C^ be 
the smallest set that contains Ca and for any £^'^— formula 3x0(x) we have f3x(f>{x) ^ C^, then this 
new countable language can also be re-coded, and this recoding can be generalized to £^'^— terms and 
£^'^— formulas. We wish to compute an upper bound for the codes of evaluations on a set of terms A. For 
a given A, all the atomic formulas, in the language Ca, constructed from terms of A are either of the form 
t = s or of the form t ^ s for some t, sG A. And every member of an evaluation p on A is an ordered pair 
like {t = s, i) or {t ^ s, i) for some t, s G A and z G {0, 1}. Thus the code of any member of p is a constant 
multiple of ('"t"' ■ '"s"')^, and so the code of p is bounded above by ^(11* ssa'"^"' ' ^ ^~^)- 

Lemma 16 For a set of terms A and evaluation p on it, '"p"' ^ V ((X'i('~A~')). 

Proof. It sufiices, by the above remark and what was said afterward, to note that Ylt ssa'"^"' ' '~^~^ ~ 

UteAi^ty^^^ = iUteA^ty^^^ ^ p(r^n)2logrA- ^ p(r^nlogrA-) ^^^ ^^^^ r^nlogrAn ^ ^^(r^n)_ q 

Let us have another look at the above lemma, which is of great importance. For a set of terms A, there 
are |A| terms in it (the cardinality of A). So, there are 2|Ap atomic formulas constructed from the terms 
of A (atomic formulas of the form t = s or t ^ s for t, s G A). And thus, there are exp(2|Ap) different 
evaluations on the set A. And finally note that by | A| ^ (log'~A~') we get exp(2|Ap) ^ V (exp((log'~A~')2)) ^ 
V (wi('~A~')). So, in the presence of aJi('~A~') we have all the evaluations on A in our disposal. 

All these concepts can be expressed in the language of arithmetic Ca by appropriate formulas. And 
"Herbrand Consistency of the theory T" can be arithmetized as "for every set of terms there exists an 
T— evaluation on it". Let HCon(T) denote the £a— formula "T is Herbrand consistent". 

3 Herbrand Models 

For a theory T, when A is the set of all terms (constructed from the function symbols of the language of 
T and also the Skolem function symbols of the formulas of T) any T— evaluation on A induces a model 
of T, which is called a Herbrand model. Here we use this notion for building up a definable inner model, 
which will constitute the hear of the proof of our main result for IAq + f^i. 
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3.1 Arithmetically Definable Herbrand Models 

In the sequel, we arithmetize Herbrand models. 

Definition 17 Let £ be a language and A be a set of (ground) terms (constructed by the Skolem constant 
and function symbols of £). 

Put A^*^^ = A, and define inductively 

A(^'+i> = A(fc> u {/(ti, . . . ,t„0 I /g£ & ti, . . . ,t„GA<'=)} 

U{f3.^(.)(ti,...,t™) I r^;^^ A;&ti,...,t^eAW}. 

Let A<°°> denote the union U^.^j^A^'^^ 

Suppose p is an evaluation on A^°°^ Define nT(A,p) = {t/p \ tGA^°°^} and put the £— structure on it by 

. /^(^■^)(ti/p, . . . , tjp) = fih, ..., t^)/p, and 

• i?^(^'^^ = {{h/p, ..., tjp) I p h R{tu . . . , t^)}; 

for /,i?e£ and ti,...,t„GA<°°>. O 

Lemma 18 The definition of £— structure on in(A,p) is well-defined, and when p is an T— evaluation on 
A<°°>, for an £-theory T, then m(A,p) ^ T. 

Proof. That the definitions of /^(^'P) and i?^(^'P) are well-defined follows directly from the definition 
of an evaluation (Definition pi). By the definition of A^°°^ the structure in(A,p) is closed under all the 
Skolem functions of £, and moreover it satisfies an atomic (or negated atomic) formula Aitijp, . . . , tm/p) 
if and only if p |= A{ti, . . . ,tm)- Then it can be shown, by induction on the complexity of formulas, that 
for every RNNF formula ijj, we have Yn{A,p) \= ip whenever p satisfies all the available Skolem instances 
of?/;inA<~>. @ 

We need an upper bound on the size (cardinal) and the code of A^-'^ 

Lemma 19 The following inequalities hold when '"A"' and |A| are sufficiently larger than n: 

(1) |A<")| <P(|A|"'), and 

(2) rAMn^p((rAn)|A|'"-^'y 

Proof. Denote ^A^'^)^ by Afc (thus ^A^ = Aq = A) and \A^^^\ by ak (and thus |A| = ao = a). We first note 
that CTfc+i ^ (Jk + Ma^^ + ka^ for a fixed M. Thus ak+i ^ 'Picr^'^^), and then, by an inductive argument, 
we have o"„ ^ 7^(0""'). For the second statement, we first compute an upper bound for the code of the 



Cartesian power A"^ for a set A. By an argument similar to that of the proof of Lemma 16, we have 
r^fc+in ^ P(nteAfc&,6A^^^ " ^^^) ^ P(^A'=^I^I ■ ^A^l^l'), and thus ^A"^^ ^ ^(r^nlAr^ ^^^ ^^ ^^^^^ ^y 
induction on m. Now we have Afe+i ^ V{^A^'''^^ ■ r(A<'=))*^^ ■ r(AW)'=^) for a fixed M. So, Afc+i ^ P(A^''') 
and finally our desired conclusion Am ^ 'P(A'^ ™ ) follows by induction. @ 

Stating the above fact as a lemma, despite of the fact that it is indeed a crucial tool for our arguments, 
let us state the following corollary of it as a theorem, and later on we will use the theorem and will leave 
the lemma right here. 
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Theorem 20 If for a set of terms A with non-standard '"A"' the value a;2('~A~') exists, then for a non- 
standard j the value '"A^-'^"' will exist. 

Proof. There must exist a non-standard j such that j ^ log ('"A"'). Thus 2(j + 1)! ^2^^ ^ log '"A"'. Now, 



by Lemma 



19 



we can write rAO)n ^ p ((rAn)|A|0+i)^ j ^ ^ (^(22i°g^^^)(i°g^^^)'^+^'' ) ^ 



V (exp((logrA^)20+i)!)) ^ V (exp(tJi(logrA^))), or in other words ^A<J>^ ^ V (tJ2(^A^)). 



The reason that Theorem 20 is stated for non-standard A is that the set A^°°\ needed for constructing 
the model t\l{A,p), is not definable in Ca- But the existence of the definable A^-'^ for a non-standard j can 
guarantee the existence of A'°°' and thus of tYl(A,p). This non-standard j exists for non-standard '"A"'. 

3.2 The Main Theorem for IAq + Qi 

Two interesting theorems were proved by Z. Adamowicz in [3] about Herbrand Consistency of the theories 
IAq + fim for rn ^ 2: 

Theorem 21 (Z. Adamowicz [3J) For a bounded formula 6{x) and m ^ 2, if the theory (IAq + fim) + 
3xEiog"^'^^9(x) + HConfojm-2(IAo + ^m) is consistent, then so is the theory (IAq + flm) + ^x^&)g"^~^'^9{x), 
where HConfopm-2 is the relativization o/HCon to the cutiog"^'"^. O 

Theorem 22 (Z. Adamowicz [|3j) For any natural m,n ^ there exists a bounded formula rjix) such 
that (IAq + fim) + ^xEiog'^rjix) is consistent, but (IAq + fim) + ^xE&ig^~^^ri(x) is not consistent. O 

These two theorems (by putting n = m -\- 1 ioi m ^ 2) imply together that for any m ^ 2 : 

IAq + ^mV HConfogm-2(IAo + ^m)- 



Here we extend Theorem 21 for IAq + fii, namely we show that 



Theorem 23 For any bounded formula 6{x), the consistency of the theory (IAq + fii) + 3x&iog'^6{x) + 
HCon(lAo + f2i) implies the consistency of the theory (IAq + fii) + 3xE(!og^9{x). 



The rest of this section is devoted to proving this theorem. Let us note that Theorem [22] holds already 
for IAq + ^1, and below we reiterate the part that we need here: 

Theorem 24 (Z. Adamowicz [i3j) There exists a bounded formula rjix) such that the arithmetical theory 
(IAq + fii) -|- 3xEiog'^ri(x) is consistent, but (IAq -\- fii) -\- 3xEiog^ri(x) is not consistent. O 



Having proved the main theorem (23), we can immediately infer that 



lAo + ni\/ HCon(lAo + Qi 



As the proof of Theorem 23 is long, we will break it into a few lemmas. First we note that a G iog^ if and 
only if there exists a sequence {wq, Wi, ■ ■ ■ , Wa) of length (a + 1) such that Wq = exp^(O) = 2^, and for any 
j < a, Wj+i = 0Ji{wj). Noting that a;i(exp^(j)) = exp^(j + 1) one can then see that Wa = exp'^(a), and so 
a^Sog^. This can be formalized in IAq + fii by an arithmetical formula. Note that the code of the above 
sequence is bounded by '^(117=0 '"^i) ^ '^(6^P(1^7=o 6xp^(j))) ^ V (exp^(a -\- 1)) ^ V (ci;i(exp^(a))). So, 
in the presence of Qi, the existence of exp^(a) guarantees the existence of the above sequence of w/s. 

^Sa&fe"^'" http://saeedsalehi.ir/pdf/licon.pdf (c) Saeed Salehi 2010 



page 12 (of 20) Herbrand Consistency of Some Arithmetical Theories 



For proving Theorem 23 let us assume that we have a model 

M h (lAo + fii) + {aeiog^ A e{a)) + HCon(lAo + fii), 
for some bounded formula 6{x) and some non-standard aG A^, and then we construct a model 

AT 1= (lAo + ni) + 3xe£og^e{x). 

If our language of arithmetic Ca contains the successor function s, then define the terms j's by in- 
duction: = 0, and j + 1 = s(j). If Ca does not contain s, then it should have the constant 1, and in 
this case we can put j + 1 = j + 1. The term j represents the (standard or non-standard) number j. 
For the sake of simplicity, assume it) denotes the Skolem function symbol f3y{y=uj-i^{x))- Put wq = 4 and 
inductively Wj+i = xv{\Nj). Then w^, in the theory IAq + fli, is the term which represents exp^(A;). Finally, 
put A = {0, . . . ,Ci;i(a;), Wo, . . . , Wq,} = {j \ j ^ uji{a)} U {vjj \ j ^ a}. We can now estimate an upper 

bound for the code of A: ^A^ ^ V (11^=^^"^ 2^') ^ V {exp{ui{af)) . 

So A has a code in JH (since Ai \= a E iog'^), and moreover u;2('~A~') exists in A^, because aj2('~A~') ^ 
V {U2{exp{uji{af))) ^ V (exp(a;i(a;i(a)2))) ^ V (exp^ (4(loga)^)) ^ V (exp2(a)). 



Thus by Theorem 20 there exists a non-standard j such that A'-'' has a code in At. Since by the assumption 



above we have Al |= IICon(IAo + fii), then there exists an (IAq + fii)— evaluation p on A^-'^ (in Al). Now, 



by what was said after the proof of Theorem 20 one can construct the model nT(A,p) = M . By Lemma 18 
we have M |= (IAq + f^i), and also M |= a/pElog^ follows from the existence of Wj/p's. It remains (only) 
to show that nT(A,p) ^ 9{a/p). 

For this purpose we prove the following lemmas where we assume that M. is as above and there are 
some non-standard set of terms and evaluation A,p in Al such that A D {0, . . . , uji{q)} for a non-standard 
aG Al, and p is an IAq— evaluation on A^°°^ 

Lemma 25 //lTI(A,p) |= tjp ^ ijp holds for a term t and i ^ oJi^a) in Al, then Xn{A,p) \= t/p = j/p 
for some j ^ i. 

Proof. By the assumption Al |= "p |= t ^ i". We prove by induction on i that there exists some j ^ i in 
M such that M\= ''p\=t = f . 



For i = by Example 11 the assumption Al |= "p |= t ^ 0" implies Al |= "p |= ^ = 0", noting that 



p is an Q— evaluation on A^'^^Vand thus all the needed Skolem terms are in p's disposal. 



• For 2 + 1 we have Al |= "p |= t ^ i\/t = s(i)" by Example 11 and the assumed satisfaction 
M \= ''p \= t i^ •s(z)". Then if Al |= "p \= t = s{i)" we are done, and if Al |= "p |= t < i" by the induction 
hypothesis there must exist some j ^ i in Al such that Al |= "p |= ^ = j" ■ @ 



Remark 26 The proof of the above lemma does not depend on the axioms of Q (and IAq). Indeed, in 
some axiomatization of Q in the literature, the sentences 



ijj = \/x{x ^ — 7- a; = 0) and </? = \/x\/y[x ^ sy ^ x = syV x ^ y) (see Example 11 ) 
are accepted as axioms. In our axiomatization, the above sentences were derivable theorems. In some 
axiomatizations of Q our axiom A4 is replaced with A'^ : \/x, y{x ^ y ^ ^z[z + x = y]); note the difference 
oi X + z in A4 and z + x in A'^ (see e.g. [7]). In this new axiomatization the sentence ip is not derivable. 



However, since we have IAq \- ip A (f, then by the argument of Remark 13, the above Lemma 25 can be 



proved by using the fact that p is an IAq— evaluation on A^°^^ O 
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Lemma 27 For any CA—term t{xi, . . . , x^) and ii, . . . ,1^ ^ UJi{a), if Ai \= x ^ t{ii, . . . , i^) for some 
X, then for an CA—term t'{xi, . . . ,Xk) and some ji, . . . ,jk ^ uJi{a) we have Ai \= x = t'(ji, . . . ,jk)- 

Proof. By induction on (the complexity of) the term t. 

• For t = and t = xi the proof is straightforward. 

• For t = 5u the assumption M. \= x ^ su{ii, . . . , im) implies that either A^ |= x = su{ii, . . . , im) or 
A4 \= X ^ u{ii, . . . , im) is true, and then the conclusion follows from the induction hypothesis. 

• For t = u + v, and the assumption Ai \= x ^ u{ii, . . . , im)+v{,ii, • • • , im), we consider two cases. First 
if A^ 1= ^ ^ w(zi, . . . , im) then we are done by the induction hypothesis. Second if A^ |= u{ii, . . . , im) ^ x 
then there exists a y such that Ai \= x = u{ii, . . . , im) + y and moreover Ai \= y ^ f (ii, . . . , im)- Now, by 
the induction hypothesis there are a term t'{xi, . . . ,Xk) and some elements ji, . . . ,jk ^ (^i(«) such that 
Ai \= y = t'{ji, . . . ,jk). Whence we finally get the conclusion Al |= x = u{ii, . . . ,im) + t'{ji, . . . ,jk). 

• For t = M-f , by an argument similar to that of the previous case, we can assume Al |= u{ii, . . . , im) ^ 
X ^ u{ii, . . . ,im) ■ v{ii, . . . ,im)- There are some q,r such that Al |= x = u{ii, . . . ,im) -q + r and 
A4 \= r ^ u{ii, . . . ,im)- We also have A^ |= g ^ v{ii, . . . ,im)- By the induction hypothesis there are 
terms t',t" and ji, . . . ,jk ^ Wi(a) such that Ai \= q = t'(ji, . . . ,jk) A'^ = t"{ji, ■ ■ ■ ijk)- Thus we finally 
have Al 1= X = u{ii, ...,im)- t'{ji, . . . ,jk) + t"{ji, . . . ,jk)- © 



Lemma 28 For i,j, k ^ oJiia) in Ai we have 

(1) ifi^j^ uJi{a) then )M{A,p) \= i/p ^ jjp ; 

(2) if i + j ^ uJi{a) then nT(A, p) \= i/p + j/p = i + j/p ; 

(3) if i ■ j ^ Ui (a) then nT(A, p) \= i/p ■ j/p = i ■ j/p ■ 

Proof. We need to show for the i,j ^ 0Ji{a) that 

(1) ii M^i^j then M^''p^i^f\ 

(2) if Al 1= i + J ^ wi(a) then Al |= "p |= i + j = i + j" , and 

(3) ii Ai^i-] ^ uji{a) then Al ^ "p |= i ■ j = i-f . 

First we note that the statement (2) above implies already (1), since if we have Al |= i ^ j, then for some 
k we should have Al \= i + k = j^ and then by (2), Al |= "p |= i + fc = j" which implies (by y44 of Q - 

one 



see Example 11) that Al |= "p |= i ^ j". By induction on j, very similarly to the proof of Lemma 



25 



can prove the statements (2) and (3), noting that the evaluation p must satisfy the following axioms of Q 

As : Vx(x + = x); Aq : VxVy(x + sy = s(x + y))] 

A-j : Vx(x -0 = 0); A^ : VxVy(x ■ sy = x ■ y + x). 



Corollary 29 Suppose for an CA—term t(xi, . . . ,Xm) and some elements ii, . . . ,im,'>' ^ ^i{<^), we have 
Al 1= t(zi, . . . ,im) = i- Then we must also have Vd^K.p) \= t{ii/p, . . . ,im/p) = i/p- 



Proof. By induction on t using Lemma 28 
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Lemma 30 Suppose t{xi, . . . , Xm),t\xi, . . . , Xm) are two Ca— terms and ii, . . . ,im ^ a^ are elements of 
Ai for some standard number /c G N. Then, if Ai \= t{ii, . . . ,1^) = t'{ii, . . . ,1^) holds, l\l{A,p) \= 
t{ii/p, . . . , im/p) = t'iii/v^ • • • ) im/v) rnust hold too. 

Proof. By ii, . . . , z^ ^ a'^ we have t(ii, . . . , im) ^ uji{q). Put i be the common value i = t(zi, . . . , im) = 



t'{ii, ..., im)- Then By Corollary 29 we have iTT(A,p) |= t{ii/p, . . . , im/p) = i/p = t'{ii/p, • • • , im/p)- ^ 



Lemma 31 Suppose t{xi, . . . , Xm),t'{xi, . . . , Xm) are two Ca— terms and ii, . . . ,im ^ a'' are elements of 
Ai for some standard number k&N. If we have A4 \= t{ii, . . . ,im) ^ t'{ii, . . . ,im) then we must also have 
the satisfaction X\l{A,p) \= t{ii/p, . . . , im/p) ^ t'{ii/p, ■ ■ ■ , im/p)- 

Proof. Noting that Q h Va;, y(a; ^ y -v^ 3z{x + z = y)) by the assumption there exists an (3EAi such 



that A^ 1= t{ii, . . . , im) + /3 = t'{ii, . . . , im)- On the other hand Ai \= j3 ^ t'iii, - - - , im), so by Lemma 27 
there exist a term u and some ji, . . . ,jk ^ 0Ji{o:) such that Ai \= (3 = u{ji, . . . ,jk)- Thus the equality 
t{ii,...,im) +u{ji,...,jk) = s{ii,---,im) holds in At- Now, by Lemma |30} 

m(A,p) \= t{ii/p, . . -,im/p) + u{ji/p, . . . ,jk/p) = t'{ii/p, . . -,im/p), 
whence t{ii/p, . . . , im/p) ^ t'ih/p, - - - , im/p) is satisfied in in(A, p). O 



Lemma 32 Suppose t{xi, . . . , Xm),t'{xi, . . . , Xm) are two Ca— terms and ii, . . . ,im ^ a^ are elements of 
AI for some standard number kEN. If it is true that A\ \= t{ii, . . . , im) 7^ ^'(^i, • • • , im) then nT(A, p) \= 
t{ii/p, . . . ,im/p) 7^ t'{ii/p, . . . ,im/p) must be true too. And if Ai \= t{ii, . . . ,im) ^ t'{ii, . . . ,im) then 
m(A,p) \= t{ii/p, . . -,im/p) ^ t'{ii/p, . . -,im/p)- 



Proof. It follows from Lemma 31 (and Remark 13) noting that p is an I Aq— evaluation on A^°°^ and 

lAo \- \/x,y[x 7^ y < — > sy ^ x \/ sx ^ y), and 

IAq \- Vx, y[x ^y < — y sy ^^ x). @ 



Theorem 33 Suppose iplxi, . . . ,Xm) is an open RNNF Ca— formula and ii,...,im ^ a^ are elements 
of A^ for some standard number k eN. If we have AI |= ip{ii, - - - ,im) then we also have X\l{A,p) \= 
il}{ii/p,...,i_mlp)- 



Proof. Lemmas 30 and [31] prove the theorem for atomic formulas, and Lemma 32 proves it for negated 
atomic formulas. For the disjunctive and conjunctive compositions of those formulas one can prove the 
theorem by a simple induction. g) 



Theorem 34 Suppose that (^^(xi, . . . iXm) is a bounded Ca— formula and that ii, . . . ,im ^ a'^ are elements 
of Ai for some standard number kEN. If Ai \= (p{ii, . . . ,im) thenXYl{A,p) \= ip{ii/p, . . . ,im/p)- 



Proof. Every bounded formula can be written as an (equivalent) RNNF formula. By Lemma 27 the range 
of bounded quantifiers of a formula whose all parameters belong to the set 

{t{ii, . . . , im) \ ii, - - - , im ^ C( ^ t is an Ca — term} 
is indeed that set again. Now the conclusion follows from Theorem [33} 
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► An alternative proof: To make this important theorem more clear, we sketch another proof, which is 
not really too different but has more model-theoretic flavor. Consider the above set again 

{[0,a])M = {t{ii,- ■ ■ ,im) \ H, ... ,im ^ a ^ t is an Ca -term} 
which is a subset of Ai closed under the successor, addition, and multiplication, and thus forms a submodel 



of Ai (generated hy [0, a] = {x E Ai \ x ^ a}). This submodel is an initial segment of Ai by Lemma 27 
Hence, whenever At \= (p, for a bounded formula ip with parameters in [0, a], then ([0, a])M 1= 'P- 

Now, similarly, the set 

([Q/p, a/p])j^ = {t{ii/p, ..., ijn/p) I ii, . . . , im ^ a & t is an £a - term} 

is an initial segment and a submodel of A/" = V(l{K,p). Thus if ([0/p, a/p])^/' |= v^, where y? is a 
bounded formula with parameters in [0/p, a/p], then nT(A,p) |= ip. Finally, we note that the map- 
ping t(zi, . . . , im) >— !■ t{ii/p, . . . , im/p) deflnes a bijection between ([0, a])M and {[0/p, a/p])^ which is also 
an isomorphism by Lemmas 30} 31 and 32 So the proof of the theorem goes as follows: 



If A^ 1= ipiii, . . . ,im) then ([O,^])^^ |= v3(ii, . . . ,i„), so ([0/p, a/p])^^ [= v2(h/p, • • • , Wp) hence 

^(A,p) h </'(«i/P' • • -^hn/p). O 

Corollary 35 By the above assumptions, in(A,p) |= 6{a/p). O 
Let us summarize what was argued in the last few pages. 



Proof. (Of Theorem 23,) By the assumption of the theorem, the theory (IAq + ^i) + 3a; Eiog 9{x) + 
HCon(lAo + f^i) is consistent. So there is a model 

M 1= (lAo + ni) + {aeiog^ A e{a)) + HCon(lAo + fii), 

where a E At. We wish to show the consistency of (IAq + f^i) + 3a:; G iog^9{x) by constructing another 
model 

Af 1= (lAo + fii) + 3xEiog^e{x). 

If a is standard (i.e., aGN) then one can take A/" = At. But if «G A^ is non-standard, then we proceed as 
follows: Take A to be the following set of terms: A = {j | j ^ u)i{a)} U {wj | j ^ a} in which the terms j's 
and Wj's are deflned inductively as = 0, j -f- 1 = sj; and wq = 4, Wj+i = wiyjj). Here s is the successor 
function, and tr denotes the Skolem function symbol f3y{y=uji{x)). Now a;2('~A~') is of order (far less than) 



2 which exists by the assumption A4 |= a Eiog . Then by Theorem 20 for a non-standard j the set of 
terms A^-'^ has a code in A4. Thus the assumption Ai |= HCon(IAo + f2i) implies that there must exists 
an (IAq + fii)— evaluation p on A^-'^ Then one can form the model J\f = tYl(A,p). Now J\f \= IAq + fii 



by Lemma 18, and also A^ |= a/pEiog^ by the deflnition of w^. Finally, Af \= 9{a/p) by Corollary 35 (of 



Theorem 34). Whence A/" is a model of the theory (IAq -|- fli) + 3xEiog^6{x); and this flnishes the proof 



of its consistency. 



4 Herbrand Consistency of IAq 

Our definition of Herbrand consistency is not best suited for IAq: there are a;i('"A~')— many evaluations 
on a given set of terms A. Though this may not seem a big problem in the first glance (one can change 
or modify the definition accordingly) but special care is needed for generalizing the results to the case of 
IAq. In the first subsection we pinpoint the critical usages of Qi and in the second subsection we tailor 
the definitions and theorems in a way that we can prove our main theorem for IAq finally. 
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4.1 Essentiality of Q 



1 



We made an essential use of Qi in the following parts of our arguments: 

1- The totality of the ui function was needed for the upper bound of the code of an evaluation on a 



given set of terms A. Namely, the code of any evaluation on A is of order ci;i('"A~'), see Lemma 16 And 
indeed there is no escape from this bound since, as it was explained after Lemma 16, there are exp(2|Ap) 



evaluations on A, and if |A| ^ log'^A^ then there could exist as many as WiC^A"')^ evaluations on A. So, 
if Qi is not available, then there could be a large and non-standard set of terms F in a model Ai such 
that Ai cannot see all the evaluations on F. One of those evaluations could be a T— evaluation, that an 
end-extension of Ai, say /C, can see. Then F is a Herbrand proof of contradiction in Ai because in A^'s 
view there is no T— evaluation on F. But there could be indeed a very large T— evaluation on F which A4 
could not see, but /C can. Thus the definition of HCon is deficient for IAq (where Qi is not there) and 
one cannot consider all the set of terms; those for which the Ui of their codes exist, should be considered 
instead. 

2- The second critical use of Qi was in the definition of Wj's for shrinking the (double-) logarithmic 
witness Ai \= a E iog^ to A/" |= w^/p e £og^. There we constructed the sequence (wq, . . . ,Wq,) of terms 
such that Wo = 4 and Wj+i = tti(wj) where tv is the Skolem function symbol f3y[y=u}i{x)]- And this was 
in our disposal because Qi = \/x3y[y = ui{x)] was one of the axioms (of IAq + fii) and thus every 
(IAq + i^i)— evluation must have satisfied tti(t) = Ui(t). 

Note that we also required A to contain {j \ j ^ u)i{a)}, but for this we did not need the existence of 
Ui{a); it was guaranteed by the assumption AA \= aEiog^. 

4.2 Tailoring for IAq 

Here we introduce the necessary modifications on the above two points. 

4.2.1 The Definition of HCon* 

The first point can be dealt with by tailoring the definition of HCon for IAq: 

Definition 36 A theory T is called Herbrand Consistent*, denoted symbolically as HCon*(T), when for 
all set of terms A, if ui{'~A~^) exists then there is an T— evaluation on A. O 

This, obviously, can again be formalized in the language of arithmetic. The new definition cannot harm 
our arguments too much, because we needed HCon only for some special set of terms. And it was A^-'^ 
for a non-standard j where A = {j | j ^ ui{a)} U {w^ | j ^ a}. For constructing the model in(A,p) we 



already needed the existence of (X'2('~A~') (see the beginning of the proof of Theorem 23 before Lemma 25 ). 
Thus if we require the existence of Ci;i('~A~') in the definition of HCon*, then we will need the existence of 
072 ('"A"') later in the proof! Thus the first deficiency can be overcome. 

4.2.2 The Cuts I and J 

In the absence of Qi we cannot define the above sequence (wq, . . . , Wo) satisfying Wj+i = ui{\Nj). The 
most we can do inside IAq is to define a sequence like {vq, . . . , Vjs) where Vq = m and Vj+i = (f j)" for some 
fixed 171,11 eN. Then f^ = a^^ ^ P(exp^(/3)). Thus we cannot get anything larger than exp^, and so for 
shortening a witness we should start from iog and remain in the realm of iog'^. Indeed by the arguments 
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of the beginning of the proof of Theorem 23 before Lemma 25 we did not need the existence of exp (a 



for the existence of a;2('~A~'). We needed only exp^ (4(log Q;)^j^^Thus it seems natural to consider the cut 
I = {x \ 3y[y = exp^ (4(loga)'^)]} and its logarithm J' = {x \ 3y[y = exp^ (4a'^)]}. We first note that 
Adamowicz's theorem (Theorem 22) holds for IAq and any n eN; i.e., there exists a bounded formula 
whose fog'"— witness cannot consistently be shortened to iog"''^^. Indeed this theorem holds for any cut I 
and its logarithm which is definition the cut J = {x \ 3y[y = exp(x) Ay El]}. The only relation between 
(og"^ and (og"''^^ needed in the proof of Theorem 21 is that 2^ Eiog^ <^=^ xGfog'""'"^; see [3j. And the proof 
works for any cut / and J which satisfy Vx(2^ e / <^=^ xeJ). The cuts X and J defined above satisfy 
this as well (exp(x) gX <^=^ xEj). So, we repeat Theorem 21 



as: 



Theorem 37 ([.3J) There exists a hounded formula ri{x) such that the theory IAq + 3xElri{x) is consis- 
tent, but IAq + 3xEj'rj{x) is not consistent. O 



4.2.3 The Main Theorem for lAn 



Let us note that the following theorem together with Theorem 37 prove that IAq 1/ HCon*(IAo). 



Theorem 38 For any bounded formula 9{x), if the theory IAq + 3xeZ9{x) + HCon*(lAo) is consistent 
then so is the theory IAq + 3xEj'9{x). 

Proof. Suppose the theory IAq + 3xEl9{x) + HCon*(lAo) is consistent. So there exists a model 

M^lAo+ {aeXA9{a)) +HCon*(IAo), 
where aEM.. We will show the consistency of IAq + 3xEj9{x) by constructing another model 



If a is standard (i.e., aGN) then one can take N = M.. But if aEAi is non-standard, then we proceed as 
follows: 

Let T = {0,0 + 0,0^ 
formula 3y{y ^ x"^ A y 



^2 c c^ 



c^ + 0,sc, qc, (sc)^, (sc)^ + 0} where q is the Skolem function symbol for the 



X 



and c is the Skolem constant symbol for the sentence (see Example 12 ) 



3x(3w{w ^ x"^ Aw = x"^) A \/v{v ^ (sx)^ Av ^ (-sa^)^)) • 



We can use the argument of Example 12, since for the bounded formula iplx) = 3y ^ x'^{y 



X ■ x) 



the sentence ind^ is an axiom of the theory IAq. Take A = 
which the terms j's and z^-'s are defined inductively as = 0, 

W2( 



A"') is of order exp^ (4(loga)^) which exists by the assumption A^ |= aGX. Then by Theorem 



T U {j I j ^ coiia)} U {zj I J ^ 4a*} in 
j + 1 = sr, and zo = 2, zj+i = q(zj). Now 

for 



20 



a non-standard j the set of terms A^-'^ has a code in A^. Thus the assumption A^ |= HCon*(IAo) implies 
that there must exists an IAq— evaluation p on A^-'^ Then one can form the model J\f = tYl{A,p). Now 



J\f \= IAq by Lemma 18, and also A^ |= a/pE J by the definition of Z4q4 (which represents exp^(4a'^)). 
Note that p |= Zj+i = Zj -z^ by the argument of Example 12 , and also the code of the sequence (zq, . . . , Z4a4) 



is of order exp ((4a^)^) ^ exp^(4(loga)^) which exists since aGX. Finally, M |= 9[a/p) by Corollary 
(of Theorem [34|) . Whence A/" is a model of the theory IAq + 3xEj9{x)] what proves its consistency. 



35 



rT,ated . 
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5 Conclusions 

An important property of Herbrand consistency of the theories IAq + f^i and IAq has been proved. That 
property immediately implies Godel's second incompleteness theorem for the notion of Herbrand consis- 
tency in those theories. However, this version of Godel's theorem has come a long way. The original 
presumption of Paris & Wilkie [TU] asked for a proof of IAq 1/ CFCon(lAo), without specifying any vari- 
ant of Cut-Free Consistency CFCon: "Presumably IAq 1/ CFCon(IAo) although we do not know this at 
present". Willard [H] solved this problem for the Tableau Consistency variant. Pudlak [11] asked a more 
specific question: "we know only that T \f HCon(T) for T containing at least IAq + Exp, for weaker 
theories it is an open problem". In |13j this problem was studied for the theories IAq -|- f^i and IAq (and 
a theory in between these two, namely IAq plus the totality of the x i— )■ x^°^ ^ function). The proof of 
IAq -|- fii 1/ HCon(IAo -|- fii) given here was presented for the first time in Chapter 5 of [13]. But the 
unprovability of HCon(IAo) in IAq was not as easy as it would have seemed. In Chapter 3 of [13] this 
unprovability was proved for a re-axiomatization of IAq. 

Our reason for using the induction formula ind^, where '?/'(a;) is the bounded formula 3?/ ^ x^iy = x"^), 
was having an Skolem function symbol for squaring q{x) = x^. This way the Godel code of q(x) is M ■ '"x"' 
for a fixed M eN, and thus the code of q"(x) is M" ■ '"x"' which is of order exp(n). So, we could code a 
term representing the number x'^^^*^"-' (=q"'(a;)) by a number of order exp(n). But if we coded the number 
j.exp{n) fiij-gctly, that would be the code oi x ■ x ■ . . . ■ x (with 2"' — times x) which is of order ('"x"')^" or 
exp^(n). In that case, the code of the sequence (zq, . . . ,740,4) would be of order exp^((4a^)^), but we used 



the order exp ((4a^)^) in the proof of Theorem 38 (since we had at most exp^(4(logQ;)^) in our disposal 
which is far less than exp^((4a'^)^)). That way, we avoided accepting the totality of the squaring function 
Qq : \/x3y{y = x ■ x) as an (additional) axiom. 

This point deserves another look: define the terms {zj}, {uj}, and {vj} inductively as zq = 2, z^+i = 
q(zj); uo = 2, Uj+i = (zj)^; and vq = 2, v^+i = (v^)^. Then the codes of the terms z„'s and v„'s are of order 
7^(2"-), but the code of v„'s are of order P(2^"). On the other hand, the terms Zj, Uj and Vj have the same 
value (2^*) in any model of IAq. In fact, for i ^ coi{a) we have Zj G A and also Uj G A^^^; but Vj's are too 
big to fit in small sets of terms. 

Our treatment of Godel's second incompleteness theorem for Herbrand consistency in weak arithmetics, 
can be summarized in the following improvements to the classical treatments (cf. the first paragraph of 
Appendix E in [15 ]): 

(1) For Skolemizing a formula we did not transform it to a prenex normal form. This allowed a more 
efficient Skolemization and Herbrandization of formulas. 

(2) Propositional satisfiability was achieved by evaluations, which are partial (Herbrand) models; see also 
[2l[3lli|8l[T2l[l3]. 

(3) For logarithmic shortening of bounded witnesses in IAq, we could not go from &)g to iog"^ directly. 
Instead we used the condition c<;^(x)'^ G £og (equivalently x gX) to get to 4x'' G iog^ (equivalently x G JT"). 



For that we used the improved version of Adamowicz's theorem [3] (Theorem 37). 

(4) And finally, we used the trick of ind^ to get an Skolem function symbol for the squaring function. 
Ideally, one would not use any induction axiom for proving a formula like Qo : ^x-^y{y = x^). This is an 
Q— derivable sentence, and adding it as an axiom seems much more natural than proving it by an inductive 
argument. But, fortunately, there was a way of avoiding the acceptance of Qq as an axiom, and that was 
proving its Hi— equivalent \/x3y ^ x^{y = x^) by induction on its bounded part 3y ^ x'^{y = x^) (see 



Example 12 and the proof of Theorem 38). That induction axiom could give us a free Skolem function 



symbol for the squaring operation, provided that we did not prenex normalize the induction axiom, and 
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instead Skolemize it more effectively — see point (1) above. Prenex normalizing and then Skolemizing the 
induction axioms can be so cumbersome that many would prefer avoiding them, but accepting new axioms 
instead! Trying to prenex normalize the induction axiom ind^ for ip = 3y ^ x'^{y = x^) in Example 12 
can give a hint for its difficulty. 



In the end, we conjecture that by using our coding techniques and definitions of Herbrand consistency, the 
results of L. A. Kolodziejczyk |8] can be generalized for showing the following unprovability: 

Conjecture 39 U„(lAo + l^n) ^HCon*(lAo). 

Question 40 Can a Book proof (in the words of Paul Erdos) be given for Godel's second incompleteness 
theorem T l/'HCon(T) for any theory T 3 Q and a canonical definition of Herbrand consistency "HCon? 
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